Pitch declination and reset as a function of utterance duration in conversational speech data

نویسندگان

  • Céline De Looze
  • Irena Yanushevskaya
  • Andy Murphy
  • Eoghan O'Connor
  • Christer Gobl
چکیده

This paper describes the declination trends of f0 in conversational speech data. A 10-minute dialogue interaction from a corpus of spontaneous speech was annotated to identify intersilence units (ISU) and turns. Detailed annotation of the ISUs was conducted in terms of communicative types and pitch patterns. f0 declination was measured by (1) fitting a regression line to f0 trajectories and (2) by fitting additional regression lines to the data points below and above the original (central) regression line. The slope of declination as well as the height of ISU/turn-initial f0 peak were examined as a function of the duration of the ISU or turn. The results suggest that declination is indeed present in conversational speech data, at the level of both the ISU and the turn (73% of the analysed ISUs exhibited negative f0 declination slope). There is a tendency for the steepness of the slope to decrease and the height of ISturn-initial f0 peak to increase as the duration of the ISU or turn increases. The results are discussed in the context of Projection and Reaction theories and of Hard vs. Soft preplanning of speech production. The findings are of potential interest for the development of human-machine dialogue systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Acoustic Study of Emotivity-Prosody Interface in Persian Speech Using the Tilt Model

This paper aims to explore some acoustic properties (i.e. duration and pitch amplitude of speech) associated with three different emotions: anger, sadness and joy against neutrality as a reference point, all being intentionally expressed by six Persian speakers. The primary purpose of this study is to find out if there is any correspondence between the given emotions and prosody patterning in P...

متن کامل

Analysis of factors involved in the choice of rising or non-rising intonation in question utterances appearing in conversational speech

In general, the end of question utterances is accompanied by a rising intonation. However, non-rising intonation is commonly observed in question utterances appearing in conversational speech. In order to clarify the factors involved in the choice of rising or non-rising intonation, in the present work, we analyzed question utterances extracted from Japanese conversational dialogue speech data ...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

The Function of Pitch Range Variations in Samples of Emotional Expressions in Persian

This study aims at investigating the interface between emotion and intonation patterns (more specifically, duration and pitch amplitude of speech). To this end, the acoustic properties of spectral parameters related to speech prosody are investigated. The results of acoustic and Statistical analysis show that mean level and range of FO in the contours vary strongly as a function of the degree o...

متن کامل

Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News

This paper investigates the combined use of pause duration and pitch reset for automatic story segmentation in Mandarin broadcast news. Analysis shows that story boundaries cannot be clearly discriminated from utterance boundaries by speaker-normalized pitch reset due to its large variations across different syllable tone pairs. Instead, speakerand tonenormalized pitch reset can provide a clear...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015